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Foreword 



The statistical mechanics of nonequilibrium stationary states of dissipative systems and, in particular, the 
large deviations of some specific observables attracted a lot of interest in the past decade. The literature on 
this problem is enourmous and it is impossible to give here a comprehensive list of references. 

Part of this interest concentrated on the fluctuation relation, which is a simple symmetry property of the 
large deviations of the entropy production rate (related to the dissipated power). The fluctuation relation 
is a parameterless relation and is conjectured to hold in some generality. The discovery of this relation [1] 
motivated many studies: and experiments specifically designed to its test have been reported on turbulent 
hydrodynamic flows [2, 3, 4] and Rayleigh-Benard convection [5], on liquid crystals [6], on a resistor [7], on 
granular gases [8]. 

Unfortunately, despite this great experimental effort, the situation is very confused, more than ten years 
after [1]. After many debates, numerical simulations established that the relation holds very generally 
for reversible chaotic dissipative systems: while experiments gave promising results but revealed also some 
difficulties in the interpretation of the data that generated many controversies. 

Indeed, the apparent generality of the fluctuation relation led to the idea that it could be tested "blindly" 
just by measuring the fluctuations of the injected power in some dissipative system. On the contrary, 
experience revealed that to test this relation one has to face many difficulties, and that each experiment has 
to be interpreted by considering its own specificities. 

Moreover, when applying the theory to a real physical system, one should obviously keep in mind that 
the real system does not coincide exactly with the mathematical model we use to describe it. So that, even 
if we expect that the theory will work for the mathematical model, its application to the description of the 
experiment might require further efforts. 

The aim of this paper is to review some of these difficulties, discovered in more than one decade of trials, 
in the hope that new experiments will be performed allowing to clarify many aspects that are still poorly 
understood. 

However, before turning to this discussion, it is important to review the theoretical developments that 
generated the current interest in the fluctuation relation. Indeed, this relation is the most accessible pre- 
diction of a much more deep theory, and discussing this theory in some detail will be very useful for the 
interpretation of the experiments. 

I will focus on the stationary state (or Gallavotti-Cohen) fluctuation relation. There are other fluctuation 
relations, such as the (Evans-Searles) transient fluctuation relation, the Jarzynski equality, and the Crooks 
fluctuation relation, which hold in much more generality. These relations have found many interesting 
applications in different fields, for instance biophysics of large molecules. However they are not directly 
related to the ideas discussed here. A review of these relations is beyond the scope of this paper. 

Everywhere in the paper I will only discuss the main ideas, giving references to the original literature for 
more detailed discussions. 

1 Why should we test the fluctuation relation? 
1.1 Definitions 

To begin, we will fix some notations. We will consider a dynamical system described by a set of state 
variables x = (xi, ■ ■ ■ , Xn) and evolving according to the equation of motion x = F(x). A trajectory of the 
system generated by an initial datum xq will be indicated by x(t), while segments of trajectory of duration 
t will be indicated by x(t), i.e. x(t) = {x(t),t £ [to,t + r]. Given an observable which is a function of the 
state x, O(x), we will indicate the average over a segment of trajectory by 




to 



(1) 
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and sometimes we will use the shorthand notations 0(t) = 0(x(t)), and O t = 0[x(t)], so that 

I ft +T 



1 fiOtT 

= - / dtO(t) . (2) 

T An 



'to 



1.2 The fluctuation relation 



The fluctuation relation was first introduced in [1]. There, a system of Lennard- Jones-like particles subject 
to a dissipative force (inducing shear flow) and to a thermostatting mechanism was investigated. The aim 
was to test the conjecture that the nonequilibrium stationary state reached by the system after a transient 
could be described by a probability distribution over the segments of trajectory x(t) having the following 
form 1 : 

A _1 [x(f)l p-TEi.+ Ai [*(*)] 

MM*)} ~ ^4r^ ~ — z — • (3) 

In the latter expression, A„[x(t)] is the expansion factor over the trajectory, related to the sum of all positive 
Lyapunov exponents computed on the segment x(t) (see Appendix A for a precise definition of all these 
quantities). The normalization factor is Z = X)x(t) ^-u 1 [ x (*)]- The proposal (3) originated from earlier 
studies of chaos and turbulence [9] and from periodic orbit expansions [10] 

Clearly a discretization in the "space of trajectories" x(i) is needed to give a precise mathematical sense 
to the expression (3). This problem is highly non trivial: a theory leading to an invariant measure of the form 
(3), based on Markov partitions to discretize phase space, was developed in [11, 12, 13, 14, 15, 16, 17, 18, 19], 
see also [20, 21, 22, 23, 24] and in particular [25, 26] for less technical discussions and [27] for a nice recent 
review of this problem. In this context the measure (3), seen as a measure on phase space 2 n{x), is called 
Sinai-Rucllc-Bowcn (SRB) measure. 

The system studied in [1] was described by reversible equations of motion. Then, it is straightforward to 
show [1, 17, 18, 19] that, if Ix(t) is the time-reversed of x(t), 

n{Ix{t)} e -T£ i|+ A<[/x(t)] e rE,,_A,[x (i) ] -e , w 

using the symmetry properties of the Lyapunov exponents under time-reversal, i.e. S» + -M^ x (*)] = 
— J2i - , see Appendix A. 

In (4) the phase space contraction rate a[x(t)] = — Aj[x(i)], equal to minus the sum of all Lyapunov 
exponents, appeared. This quantity can be easily measured in a numerical simulation, as it is possible to 
show (Appendix A) that 



cr[x(i)] = - [ dta 
T Jo 



(x(t)) , a{x) = -V • F(x) , (5) 

i.e. c(x) is minus the divergence of the right hand side of the equation of motion. Its average is <r + = 
(c(x)) = (a[x(t)]}. An infinitesimal volume dx will evolve according to -^.dx = —a(x)dx. 

It is possible to compute the probability of observing a value <r[x(t)] = a, P{cr) = P{cr[x(i)] = a}. Using 
(4) and a[Ix(t)} = -cr[x(t)], we get 

P(a)= Yl M{x(t)}= Yl ^{/x(t)} = e- ^ M {x(i)} = e™P(-a) . (6) 

x(t)|<r[x(t)]=<r x(t)|<r[x(t)]=<T x(t) |o-[x(t)]=-cr 



1 Thc expression (3) has to be intended in the following sense: if one is able to generate M segments Xj(t),j = 1, • ■ ■ , M 
of trajectory of duration t with uniform probability, then the average of an observable 0[x(t)] which is a functional of the 
trajectory can be computed as YliLi C[xi(t)]/i{xj(t)}/ 'Yl^L-i /i{xi(t)}. However this is not the case in an experiment: in this 
case the segments will be generated already according to the weight (3) and the average of an observable must be computed, 
as usual, as M~ 1 52 i=1 0[xj(t)]. This distinction was not clear in [1] and this generated some confusion in the subsequent 
literature. 

2 This is done by relating trajectories to initial data generating them, after a Markov partition of phase space has been 
constructed. In particular, x = x(0) is the point in the middle of the segment x(t) , t g [— r/2,r/2] and the limit r — > oo has 
to be properly taken. 
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The time-reversibility of the dynamics and the assumption (3) imply that P(u) should verify the symmetry 
relation (6), which is a first example of a fluctuation relation and was observed to be true in the numerical 
simulation of [1]. 

Some remarks are in order at this point. 

1. Eq. (3), and consequently (6), are the leading contributions to the probability for r — > oo, but for finite 
t corrections are present. 

2. As it will be discussed in detail in the following, if er[x(i)] is unbounded a more complicated analysis 
is required; thus it is convenient for the moment to restrict the attention to systems such that <r[x(t)] is 
bounded. 

3. It can be proven that a + > under very general hypotheses [28]. Moreover, if <r + = the SRB 
measure (3) reduces to the volume measure 3 and the system is at equilibrium. The relation (6) reduces to 
P{cr) = P(—cr). Thus in the following we will assume that cr + > and in this case the system will be called 
dissipative [25, 27]. In this case volumes will contract on average and the SRB measure will be concentrated 
on a set of zero volume in phase space. 

Under these hypotheses, defining the normalized variable p = cr[x(i)]/<7+, we expect that for large r the 
probability of p will be described by a large deviation function, i.e. that 4 

P(p<J+) ~ e T ^ (p)+o(T) . (7) 

As a is bounded, p is also bounded and for r — > oo we have \p\ < p*- the value of p* can be easily identified 
as we expect that Coo(p) = — oo for \p\ > p* , i.e. deviations bigger than p* have zero probability 5 in the limit 

T — ► OO. 

Using (6) and (7) we get the following relation [17, 18, 19]: 

Defining the large deviation function Coo(p) = lim T ^ 00 i log[P(er[x(f)] = p<r+)], and p* as the maximal value 
°f \p\ f or which Coo(p) > — oo, we have 

Coo(p) - Coo(-P) = PO-+ , \p\ < P* ■ (8) 

We will refer to the proposition above as the fluctuation relation. The two conditions that r is large and 
that p is bounded are often misconsidered but are very important and should not be forgotten [19] if one 
wants to avoid erroneous interpretations of (8). 

By looking at the sketchy derivation in (6), one realizes that the fluctuation relation can be extended in 
the following way. Consider an observable 0[x(£)] that is even or odd under time reversal, i.e. 0[x(i)] = 
±0[7x(t)], and the normalized variable q — 0[x(i)]/0+, where + = (0[x(t)]) is the average of O in the 
stationary state. Consider the joint probability P(p, q) ~ expr( OD (p, q). One can show [29, 30] that 

Coo(p,q) - Coo(-p,±g) =po-+ , \p\ < p* , \q\ < q* , (9) 

and the sign in (9) must be chosen according to the sign of O under time reversal. Eq. (9) can be extended 
to any given number of observables having definite (eventually different) parity under time reversal. 

The fluctuation relation attracted a lot of interest after [1] because it is a parameter-free relation that 
might hold in some generality in nonequilibrium systems. For this reason in the last decade it has been 
numerically tested on a lot of different models, often with positive result, and sometimes with negative or 
confusing results. In experiments the situation is complicated by the presence of many noise sources and by 
the difficulty to find a good modcllization of the system under investigation. 

One should keep in mind that, at least in this context 6 , the main theoretical motivation to study the 
fluctuation relation, that was already at the basis of the work [1], is to obtain some insight on the measure 

3 Or eventually to a measure which is dense with respect to the volume, fx(dx) = p(x)dx, e.g. the Gibbs distribution. 

4 By o(t) we mean a quantity Q T such that Q t /t — » for r — » oo. 
Note that the equality p* = max \a{x)\fcr+ is not true in general; obviously p* < max |<t(x)|/<T-|- is verified, but in general the 
value of p* is smaller because the observation of p = max \a(x)\/(j+ over a very long trajectory requires that cr(x(t)) = max |cr(x)| 
for all t along the trajectory, which is clearly very unlikely and might have zero probability for large r. 

6 As discussed in the foreword, the study of different fluctuation relations is beyond the scope of this paper. 
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describing stationary states of nonequilibrium system. This is clearly a very large class and it is possible 
to find nonequilibrium systems displaying any kind of strange behavior. We wish to identify a class of 
nonequilibrium systems such that the fluctuation relation holds in their stationary states. 

1.3 The "transient fluctuation relation" 

Subsequently after [1], it was noted that an apparently similar fluctuation relation holds in great generality. 
Namely, we can consider, instead of segments of trajectory drawn from the stationary state distribution, 
segments originating from initial data extracted from an equilibrium distribution (e.g. the microcanonical 
one). In other words, we extract initial data according to an equilibrium distribution and then evolve them 
using the dissipative equation of motion. If the equations of motion are reversible, it is possible to show 
that, if x(i) , t G [0,t] is the trajectory starting from x = x(Q) extracted with the equilibrium distribution 
one has [31] 

P eq {a[x(t)]=a} 

P eq {a[K(t)] = -a} ■ [ ! 

The latter relation has been called transient fluctuation relation and is very easy to prove: it follows from 
the definition of a, see Appendix B. It holds in great generality not only for the microcanonical ensemble 
but for many other equilibrium ensembles, see [32] for a review 7 . The fundamental difference between (10) 
and (8) is that in the former trajectories are sampled according to the equilibrium distribution of their initial 
data, while in the latter they are sampled according to the nonequilibrium stationary state distribution 8 . 

Then one can ask whether the fluctuation relation (8) can be derived starting from (10). Naively one 
could take the limit r — > oo of (10), claim that the initial transient is negligible and assume that (10) holds 
also for the stationary distribution. However, this is not at all trivial. Depending on the properties of the 
system, the probability P eq might not converge to a well defined probability distribution in the limit r — > oo, 
or the convergence time might be very large (practically infinite), and so on. 

Even if t _1 log P eq (p<j + ) converges fast enough to a smooth limiting function Coo(p), which will verify the 
fluctuation relation by (10), the latter may be different from the true function Coo(p) describing the stationary 
state [33]: indeed the measure (3) is concentrated on a set of zero volume in phase space if cr + > 0, and 
making statements on a set of zero measure starting from a set of initial data extracted according to the 
volume measure might be very difficult. We will give examples in section 4. 

Understanding in full generality what are the conditions that allow to extract the fluctuation relation 
(8) from the transient relation (10) seems to be a difficult task and is for the moment an unsolved problem. 
To simplify the problem and try to understand its main features, we can, following [17, 18], restrict our 
attention to a class of simple dissipative systems for which the fluctuation relation can be rigorously proved 
whithout making reference to (10), but directly from the invariant measure (3). 

1.4 Anosov systems and the fluctuation theorem: a proof of the fluctuation 
relation 

In [17, 18] a proof of the fluctuation relation was given for Anosov systems. The rigorous mathematical proof 
for Anosov maps (discrete time) is in [19], while the proof for Anosov flows (continuous time) has been given 
in [34]. 

Anosov systems are paradigms of chaotic systems. A precise mathematical definition is in Appendix C. 
Roughly speaking, they are defined by a compact manifold M (phase space) and a smooth (i.e. at least 

7 To avoid confusion note that the point of view on this subject expressed in [32] and in other papers by Evans and coworkers 
is very different from the one expressed here. 

8 It is interesting to remark that the phase space contraction rate is defined with respect to a given measure on phase space, 
see Appendix A. The fluctuation relation (8), as an asymptotic statement for large r, holds independently of the chosen measure 
(at least for smooth systems, see section 3), while the transient fluctuation relation (10) holds for finite r only if the phase 
space contraction rate is denned with respect to the equilibrium measure from which one extracts initial data. 
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Figure 1: A pictorial representation of an Anosov system. In the vicinity of a point x £ M it is possible to 
draw two families of manifolds M s >". The manifolds passing through x are the stable and unstable manifolds 
of x, M*'". Under the action of S, the manifolds MJ'" are mapped into the manifolds Mg™ passing through 
Sx. A point y £ M™ is mapped into a point Sy whose distance from Sx is larger by a factor <~ A, while a 
point z £ is mapped into a point Sz which is closer to Sx by the same factor A. 

twice differentiable) map S(x) acting on x £ M (the dynamics), having the following properties, see Fig. 1 
for an illustration: 

(1) Around each x £ M it is possible to draw two smooth surfaces M" and MJ such that points y £ 
approach x exponentially while points y £ M" diverge exponentially from x under the action of S (existence 
of the stable and unstable manifolds) ; 

(2) the rate of convergence (divergence) on the stable (unstable) manifold is bounded uniformly on M 
(uniform hyperbolicity); 

(3) the surfaces M^ u vary continuously w.r.t. x, and the angle between them is not vanishing; under the 
action of S the surfaces Mp u are mapped into the corresponding Mj'" (smoothness of the stable and unstable 
manifolds) ; 

(4) there is a point x which has a dense orbit in M (the attractor is dense). In this case the system is called 
transitive. 

In particular, 1) ensures that the system is chaotic; 2), 3) ensure that the system is smooth enough so 
that the measure (3) is well defined; and 4) ensures that the attractor is dense on M so that there arc no 
regions of finite volume that do not contain points visited by the system in stationary state. Note that as 
M is compact and S(x) is smooth, it follows that cr[x(t)] is smooth and bounded, as we assumed above. 

For Anosov systems it can be shown, by explicitly constructing a Markov partition [11, 12, 15, 24], that, if 
the initial data are extracted from the uniform distribution over M (i.e. the microcanonical distribution), the 
system reaches exponentially fast a stationary state described by the measure (3), which is called Sinai- Ruelle- 
Bowen (SRB) measure. The function ^ (p) exists and it is analytic and convex for p\ <p<pi [11, 12, 15]. If 
the system is transitive, dissipative (cr + > 0) and reversible, —p\ = p 2 = p* and Coo(p) verifies the fluctuation 
relation [17, 18] in the form (8). This is the Gallavotti- Cohen fluctuation theorem. 

Moreover, in this case the function r _1 log P eq (pa + ) converges to the limiting large deviation function 
Coo(p) describing the stationary state and satisfying the fluctuation theorem. 
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1.5 Stochastic systems (in brief) 

Alternatively, one can consider a stochastic model for a dissipative system. This was first done by Kurchan 
[35], who considered a Langevin system in presence of a dissipative force. He showed that Eq. (10) holds 
also in this case for any finite time r, and studied the conditions under which it converges to a well defined 
limiting distribution. It turns out again that the main requirements are some smoothness properties (see 
also [36] for a discussion) and the existence of a gap in the spectrum of the Fokker-Planck operator. The 
latter requirement implies that the probability distribution describing the system converges exponentially 
fast to the equilibrium distribution and that correlation functions decay exponentially, and can be considered 
as the counterpart of chaoticity in Langevin systems. In the context of stochastic systems, the fluctuation 
relation was also derived by Lebowitz and Spohn for discrete Markov processes [37]. Extensions and different 
perspectives were given by Maes [38], Crooks [39], Bertini et al. [40, 41], Derrida et al. [42], Depken [43]. 
The discussion of the stochastic case would require much more space but is beyond the scope of this paper. 

1.6 The chaotic hypothesis: a class of nonequilibrium stationary states verifying 
the fluctuation relation 

The general features that are behind all the existing derivations of the fluctuation relation are the following: 

1 ) Reversibility: the equations of motion should be reversible. This is the symmetry that is at the basis 
of the fluctuation relation, as it is clear from the derivation in (4), (6); 

2) Smoothness: the system must be smooth enough, and in particular the phase space contraction rate is 
assumed to be smooth and bounded; this ensures that p is well defined 9 and bounded and that it exists a 
value p* such that Coo(p) = — oo for |p| > p*; 

3) Chaoticity: the system must be chaotic in the sense of having at least one positive Lyapunov exponent 
(for deterministic systems) or of having a gap in the spectrum of the Fokker-Planck operator (for Langevin 
systems); chaoticity implies that the stationary state is reached exponentially fast and that the function 
t _1 log P eq (p<r + ) converges to a limiting distribution Coo(p); 

4) "Ergodicity" (or transitivity): by this we mean that the attractor should be dense in phase space, i.e. 
that the system must be able to visit any finite volume of its phase space in stationary state. The reason why 
this property is important will be discussed below. This property can be checked, for instance, by looking 
at the Lyapunov spectrum, see [44, 45] and references therein. 

All these ingredients seem to be important for the fluctuation relation to hold. It is worth to remark 
that properties 1)~4) cannot, obviously, be directly tested in an experiment. As often in physics, we are here 
making hypotheses on the mathematical properties of a model that we assume is able to describe the system 
under investigation. From these hypotheses, we then derive some consequences, in the form of relations 
between observables (e.g. the fluctuation relation), and only the latter are accessible to the experiment. 

As far as I know, for all investigated models satisfying 1)~4) the fluctuation relation has been succesfully 
verified. On the other hand, examples are known of systems violating at least one of the conditions above and 
for which the fluctuation relation does not hold. If one of the requirements above is violated a case-to-case 
analysis is needed. 

However, it is likely that experimental systems are described by models that violate some of the require- 
ments above, in particular the smoothness and ergodicity requirements. It is then interesting to investigate 
in more detail what happens in these cases to see if, under less restrictive hypotheses, we can still draw some 
general conclusions on how the fluctuation relation will be modified. 

In this context, a chaotic hypothesis has been proposed by Gallavotti and Cohen [17, 18]: it states that, 
even if hypothesis 1 )~4 ) cannot be proven to be satisfied in a strict mathematical sense, for the purpose of 

9 Note that an obvious requirement is that <r+ > 0, i.e. that the system is dissipative; otherwise p is not defined and the 
fluctuation relation reduces to the trivial identity P(cr) = P(— a). The limit ct+ — > is non trivial as we will see in the following. 
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computing the averages of some particular observables of physical interest, still the system can be thought as 
an Anosov system and its invariant measure can be assumed to be given by Eq.(3). 

It is worth to stress that, even if we accept the chaotic hypothesis, we should take care in drawing 
consequences from it. The violation of one of the hypotheses above will be observed if one chooses a suitable 
observable, e.g. by probing motion in extreme regimes (for example, looking at very large deviations of a, 
in a sense to be made precise below). Thus, in applying the chaotic hypothesis to a given system, one must 
take into account its peculiarities to avoid contradictions, in the same way one uses the ergodic hypothesis 
for equilibrium systems: see [27] for a review. 

In the following, we begin by reviewing the evidence in favor of the validity of the fluctuation relation 
for system that do not violate hypothesis 1 )~4 ) in a substantial way; at the same time we will discuss some 
difficulties that are encountered when trying to experimentally verify the fluctuation relation. Then we will 
discuss some classes of systems in which one among 1)~4) is not true and discuss what might happen in these 
cases. 

2 Verification of the fluctuation relation in reversible, smooth and 
chaotic systems 

First of all, we wish to discuss the difficulties that are intrinsically present if one wishes to test the fluctuation 
relation. At the end of this section, we will review the numerical simulations that attempted to verify the 
fluctuation relation in systems that are not proven to satisfy requirements 1)~4) of section 1.6, but at least 
do not seem to violate them in a substantial way. 

So let us assume for the moment that the system under investigation verifies the hypotheses of the 
fluctuation theorem (for instance, it is a reversible Anosov system), but we want to test the fluctuation 
relation numerically (to debug our program). 

The main problem is that to test Eq. (8) we need to observe (many) negative fluctuations of o> for large 
t. To be precise, one should construct the function ( T (p) — - log P(pa+) and check that it is independent of 
t in a given interval of p. If this interval contains p = 0, one can test the fluctuation relation in this interval. 
This is very important to guarantee that preasymptotic effects can be neglected. 

In general, a T will be proportional to the number of degrees of freedom (it is extensive). The system 
has to be dissipative, i.e. the average cr+ must be positive, otherwise the fluctuation relation is trivial. This 
means that the maximum of P{<J T ) (or equivalently of the function C T {p) defined above) will be assumed 
close to a T = a + (or p = a T ja + = 1). The fact that a T is extensive implies that in general Coo(p) oc N, 
where A is the number of degrees of freedom in the system. 

The function ^ (p) being convex, the probability to observe a negative value of p will be smaller than 
the probabity of p = 0. Thus we can estimate this probability as 

P [negative fluctuation] ~ e rCoo(0) . (11) 

The value of Coo (0) < 0, thus this probability will be very small as long as t is large. Moreover, Coo(0) oc A so 
the probability will scale as exp(— At). In general, we have 10 , for small er+, Coo(0) ~ cr+ = t ~ 1 No-q, where 
To is the microscopic characteristic time of the system and (To is the (adimensional) phase space contraction 
per degree of freedom over a time t . Finally we have 

P [negative fluctuation] - e -(r/r )Na„ _ ( 12 ) 
2.1 Entropy production rate 

In many models of dissipative systems it turns out, by explicit computation, that the phase space contraction 
rate is given by the power dissipated into the system divided by its temperature, i.e. it can be identified 

10 See the discussion after Eq. (19) and Appendix A in [46] for an explicit computation in a very simple case. 
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with an entropy production rate [47]. This is crucial to define an experimentally observable counterpart of 
the phase space contraction rate, if one wants to test the fluctuation relation. 

The theoretical discussion of this identification is beyond the scope of this paper. So we will take a more 
practical point of view. Assume, as in section 1.6, that we are able to build a model that we believe is able to 
describe the experimental system we wish to investigate. We can then compute the phase space contraction 
rate for this particular model. If this quantity turns out to be measurable in the experiment, we can use it to 
perform a test of the fluctuation relation. Otherwise, if the phase space contraction rate does not correspond 
to an observable quantity, we cannot use this system to test the fluctuation relation. However, this seems 
not to be the case at least for the models that have been considered in the literature: it always turned out 
that the phase space contraction rate could be identified with a measurable entropy production rate [47] . 

Accepting this identification, we can now give a quantitative estimate of the probability in (12) in a 
physical example. Consider an experiment done on a resistor; we apply a field E and measure the current J 
flowing trough the resistor. The dissipated power is EJ and the entropy production is EJ/kgT, where T is 
the temperature of the resistor. We average this quantity over a time r to obtain 

*r = ^ f f Q dtEJ {t ). (13) 

It is clear that the probability of observing a spontaneous reversal of the current must be very small. Let 
us estimate it using equation (12). We consider a resistor with R = 1 k£l, a current I — 1 mA, and an 
observation time t = 1 fj,s, at temperature T = 300K and with a microscopic time (i.e. the time between 
two collisions of electrons in the resistor) t <~ lCP 13 s. We get r/r = 10 7 and using fc^T ~ 10~ 21 J we get 
Na a = RI 2 T Q /k B T = 10 5 . Thus the final result is 

P[negative fluctuation] ~ e~ 10 , (14) 

which is clearly too small to be observed in an experiment. 

Nevertheless, Eq. (12) suggests some possible strategies to enhance negative fluctuations: 1) reduce the 
observation time; 2) consider a smaller system; 3) reduce ct . Unfortunately, 1) is limited by the fact that the 
fluctuation relation holds only for large t, i.e. r 3> t ; so we can reduce r, but at best we can use t <~ 100t 
if we do not want to observe finite r effects [48, 49]. Note also that in experiments r is strongly constrained 
by the acquisition bandwidth. In any case, given that in our example Na ~ 10 5 , even with r/r = 10 2 we 
will not gain much. Reducing the system size is a very good idea (but might be difficult in experiments): 
for this reason numerical simulations to test the fluctuation relation are usually done for systems of N < 50 
particles. However, in some cases we might be interested in large systems if we want to test the chaotic 
hypothesis, as the deviation from Anosov behavior might be more evident in small systems. Finally, we 
might try to reduce the entropy production in the system. This can be done by reducing the strength of the 
applied field (the electric field in our example). There is a problem however: in the limit of small dissipation, 
the fluctuation relation reduces to the Green-Kubo relations that are well known from linear response theory 
[50, 51]. Therefore our experimental test of the fluctuation relation will reduce to a trivial test of linear 
response theory, up to 0(E 2 ) corrections. We will discuss the relation between fluctuation relation and 
Green-Kubo relations in the following section. 

2.2 Gaussian fluctuations and the Green-Kubo relation 

The fact that the fluctuation relation reduces to Green-Kubo relations in the small E limit is disturbing 
for the purpose of testing the former, but is an important check of the consistency of the theory. Indeed, 
a good statistical theory of nonequilibrium stationary states should reduce to linear response theory in the 
limit of small dissipation. The relation between the fluctuation relation and Green-Kubo relations has been 
discovered in [29, 30]: in the following we will closely follow the original derivation of [29]. 
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2.2.1 The large deviation function of a close to equilibrium 

Close to equilibrium the istantancous entropy production rate has the form a(t) = EJ(t) [51, 52], as 
discussed above, where E is the driving force and J — J/(fcsT) is the conjugated flux, see Eq. (13). To 
compute the function Coo(p) it is easier to start from its Legendre transform z(X), defined by 

Zoo(A) = - lim r" 1 log(exp[-Arcr r ]) = -max[Coo(p) - Apcr+] . (15) 

r — >oo p 

The function z OQ (X) generates the connected moments of cr T ; using Eq. (13), in the limit r — > oo, these have 
the form 11 



z (k) = 



oc 



= {-l)^ 1 lim T k - X U k ) 

r r/2 ,t/2 

dti-- - / 

r/2 J-r/2 



(16) 



(_!)*-!£;* lim T -i /" . . . f dtk (j^) . . . J{f k )) 

T ^°° i-r/2 J-t/2 



The connected correlations {J{ti) ■ ■ ■ J{tk)) c are translationally invariant due to the stationarity of the 
system and decay exponentially in the differences \U — tj\ due to chaoticity. 

From stationarity it follows that Zoo = E (J) and as (J) = in equilibrium, one has (J) <~ E and 
Zoo — <t+ ~ E 2 . Using stationarity and the exponential decay of the connected correlations one has, for 
fc>l, 



lim t- 1 / dti ■ ■ ■ / dt k (J(h) ■ ■ ■ J(t k )) c = 

T ^°° J-t/2 J-t/2 

/oo />oc 
dh ■ ■ ■ / di fc _! ( JMJih) ■ ■ ■ J(t k -l)) c = J™ , 
-oo J —oo 



(17) 



z^AJ^WA+^+O^A 3 ). (18) 



and the Jo? are finite for E — > 0. This means that the zl? ~ £? fc for fc > 1 and 

(2) 

2 

Using this result and the relation Coc(p) = minA[Apcr+ — Zoo(A)] one can prove that 12 

Coob) = ^(p - I) 2 - (P -1) 3 + --- = ^|)(P-1) 2 + 0(i? 3 b - I) 3 ) , (19) 

i.e. that Coo(p) is approximated by a Gaussian up to |p — 1| ~ 1/E, i.e. in an interval whose size grows for 
E — > [29, 30]. Note that from Eq. (19) we also have Coo(0) ~ — ~ E 2 ~ er + as anticipated. 

2.2.2 The Green-Kubo relation 

If the function Coo(p) is given by the first term in (19), the fluctuation relation immediately gives 

1 1 C°° 

a + = --z£ => a + = (o-(t))=-E 2 dt (J(t)J(0)) c , (20) 



11 It is convenient for this computation to change convention and define o> = r 1 dta(t)\ one can check that due to 

time translation invariancc the results arc unchanged by the convention used. 
12 Notc that z£} is negative. 
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using (17) for k = 2. Recalling that J(t) = J{t)/T, a(t) = EJ(t) and (J(t)J(0)} c is even 13 in t one obtains 

P f°° 

(J) = -J o dt (J(t)J(O)) , (21) 



and to the lowest order in E 



E 1'°° 

(J)e = ^J dt (J(t)J(0)) E=0 + 0(E 2 ) , (22) 
that is exactly the Green-Kubo relation [50] . 



2.2.3 Extension to many forces: Onsager reciprocity 

If many forces are present, we will have close to equilibrium a(t) = EiJi(t); the derivation above can be 
repeated and from the fluctuation relation a + = —\z£?, using 



z (2) 



/OO 
dt{Ji{t)JM) > (23) 
-oo 



we get 



1 f°° 

a + =^(Ji) =-Y,E i E j J dtiJiWjW) . (24) 



Defining the transport coefficient /xy from (Ji) = J^j PijEj, and 14 = L^j = | dt{Ji{t)J 3 ;(0)), we get 
from (24) the relation 

2 = L ij ■ ( 25 ) 

However we would like to prove directly the Green-Kubo relation /i^ = Ljj , and Onsager reciprocity ^ = 
Hji. This can be done by using the generalized fluctuation relation (9) for the joint distribution of a and Ji 
[29, 30]. Using Eq. (9) with p = a T /a + and 



1 



72 

dtJi(t) , (26) 



T(Ji) J-rf 

and performing a computation similar to the one in the previous section (see [29] for the details), we get 
Eq. (23) and the additional relation 

1 f°° 

W) = oE^/ dt{Ji{t)JM) , (27) 
which is the Green-Kubo relation for Ji [50]. This implies ^ — and Onsager reciprocity 14 fiij — fiji. 



2.3 Summary and a review of numerical results 

We have seen that even in the case of systems that are guaranteed to verify the fluctuation relation, a 
numerical or experimental test might be difficult. This is due to the fact that: 

• if we wish to verify the fluctuation relation indipendcntly of linear response theory, we have to apply a 
large field, such that the nonlinear terms in (19) are visible for p <~ 0: otherwise, if the function Coo(p) 
is Gaussian up to p = 0, the fluctuation relation reduces to the Green-Kubo relation; 

13 This simply follows from time translation invariancc, (J(-t)J(0)) c = (J(0)J(-t)) c = (J(t)J(0)) c - 
14 The equality Lij = Lji follows from time translation invariance, see footnote 13. 
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Figure 2: Two examples of the function Cx>(p) measured in a numerical simulation [49]. Open circles: Coo(p)- 
Filled squares: Coo(— p) +P&+- There are no free parameters. A Gaussian fit close to p — 1 is reported as 
a full line. Deviations from the Gaussian are seen close to p — 0. In the inset an enlargment of the region 
where circles and square overlap, allowing for a test of the fluctuation relation, is reported. It was checked 
[49] that in this interval of p the function Cr(p) did not depend on r, so that it is equal to the true limiting 
function (oo(p)- (Left) N = 8 Lennard- Jones particles in d — 2 subject to a constant force (electric field) 
with periodic boundary conditions and with a reversible Gaussian thermostat. (Right) Similar system with 
N = 20 and d = 3. See [49] for details on the simulation. 

• however, if E is large, |C(0)| oc E 2 is also large, and the probability to observe negative events becomes 
very small; 

• the system size must be small, otherwise again |£(0)| oc N is large: this can pose problems if we expect 
the fluctuation relation to hold only for large enough N; 

• the time r must be large enough for having an interval of p where the function £ T (p) does not depend 
on r; but if r is too large, again negative values of p are less probable and the interval might not 
contain p = 0. 

Despite these difficulties, many attempts were made to test the fluctuation relation in numerical simu- 
lations of smooth enough, chaotic and reversible systems which were not guaranteed to strictly satisfy the 
requirements of section 1.6 [1, 48, 53, 54, 55]. Unfortunately in most of these tests the distribution P(a) was 
found to be Gaussian. However these works were very useful for the development of the theoretical concepts 
discussed in the previous section, that were largely motivated by the numerical results. 

In [49] it was noted that, using Eq.s (19) and (20), it turns out that the non Gaussian term in (19) is 
proportional to jlk. E 3 (p — l) 3 ; this suggested that, keeping fixed E to have a small er+, one could increase 
the non Gaussian tails of Coo {p) by increasing j7ob , which is related to the nonlinear part of the transport 
coefficient. In a fluid of Lennard-Jones like particles, the nonlinear response is observed to increase on 
lowering the temperature: this fact was exploited in [49] where it was possible to verify the fluctuation 
relation in a numerical simulation on a non Gaussian Coo(p)j see figure 2. Note that even if the parameters 
were carefully chosen, the finite r corrections to the function Coo(p) had to be taken into account to extract 
the correct asymptotic behavior for large r. In particular in the case of asymmetric distribution the first 
order correction for large r is a shift of p. Thus one can reduce the error by shifting p in such a way that 
the maximum of Ct{p) is assumed in p = 1, as we expect in the limit r — > oo. See [49, 55] for a detailed 
discussion. 
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This problem is specific of systems which admit a natural equilibrium state and are driven far from 
equilibrium by some perturbation, such as a Lennard-Jones fluid. There are however systems (e.g. the 
Navier-Stokes equations) such that one cannot reach an equilibrium state by tuning some parameter. For 
these system there is no obvious linear response theory, and a test of the fluctuation relation is stringent even 
if the distribution is found to be Gaussian. In the case of a reversible version of the Navier-Stokes equations 
(to which the chaotic hypothesis can be applied), successful numerical tests of the fluctuation relation were 
performed in [56, 57, 58]. 

The numerical results cited above strongly support the claim that the chaotic hypothesis can be applied 
to systems verifying the requirements of section 1.6. Moreover a new very efficient method to sample large 
deviations has been proposed in [59] and applied to test the fluctuation relation in some systems belonging 
to this class, with very promising results. 

3 The effect of singularities: non-smooth systems 

Despite these successes, a number of simulations performed using different ensembles found apparent viola- 
tions of the fluctuation relation, see e.g. [60] and references therein. These violations where later recognized 
to be due to the presence of singularities of the Lennard-Jones potential used in the simulation, that were 
affecting the measurements in a subtle way [61, 62, 63]. In a system of particles interacting through a 
Lennard-Jones potential, the potential energy can be arbitrarily large if two particles are close enough to 
each other. If energy is conserved, this cannot happen, but it can happen, for instance, if only the kinetic 
energy is kept constant. This is why violations were observed only when using the isokinetic thermostat. 

Consider the power injected into the system by the external forcing, W{t), and the heat per unit time, 
Q{t), dissipated by the thermostat. Energy conservation implies Q(t) — W(t) — E(t), where E = K + V is the 
total energy. Thus if the latter is conserved, Q = W, while if kinetic energy is constant, Q(t) = W(t) — V(t), 
where V(t) is the potential energy in the system. The entropy production rate, o~{t), can be defined as 
&w(t) — W(t)/T or as ctq(£) = Q(t)/T; there is no a priori reason for choosing one of the two definitions. 
The difference is a total derivative that has zero average in stationary state, so the average a + is unaffected 
by the choice. 

If one considers a microscopic model of the system, it turns out that the phase space contraction rate, as 
computed from the equations of motion, can be given by any of the two definitions above, depending on which 
metric one uses to measure distances in phase space. In fact, the phase space contraction rate is defined with 
respect to a given metric (i(x)dx and if one switches from n{x) to fj,(x)e~' p ^ the phase space contraction rate 
is changed by 0, see Appendix A. For instance, in the case discussed above, the phase space contraction rate is 
given by W(t)/T if one considers the contraction of the measure /i(p, q)dpdq = e~^ v ^ 5(K (p) — 3NT /2)dpdq, 
(p,q) being the momenta and positions of the particles, and by Q(t)/T if one considers the contraction of 
the volume measure dpdq. 

What is the effect of the total derivative VI If we consider the integrated variables, 



If V(q) is a bounded function, |V(g)| < B, the difference vanishes as 1/t for large r. This happens uniformly 
on phase space and it follows that the large deviation functions of aw and ctq are equal. Thus, for what 
concerns asymptotic statements such as the fluctuation relation, the two definitions are equivalent. The 
fluctuation relation does not depend on the measure one chooses to compute the phase space contraction 
rate, as anticipated in section 1, as long as a is smooth and bounded. 

3.1 The effect of an unbounded total derivative 

Let us now see what happens if the function V is not bounded. The effect of an unbounded total derivative 
term was first discussed in [61] and [64, 62]. Extensions were discussed in [63, 65, 36, 66]. Here we will follow 




(28) 
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Figure 3: The case of exponential tails. In this case (w(p) = — 4 verifies the fluctuation relation for 
<t+ = 1, and A = 1, hence -P(v) = e - ". The corresponding function Cq(p) coincides with Cw{p) only for 
p G [—1, 3], and does not verify the fluctuation relation for p > 1 [62, 7]. 



and extend the derivation in [63]. As part of the computation is unpublished, all the details are given in 
Appendix D. 

We consider the normalized variables pq — <jq t /(t+ and pw — cwt/c+i verifying the relation 

PQ=Pw + ^— 11 , (29) 
r 

having defined the function v(t) = V(t)/(Ta + ). The potential energy is usually bounded from below, 
V(t) > B. As only differences of V matter, we assume that B — by a shift of V, without loss of generality. 
We assume that pw has a well defined large deviation function, 

Pr{pw) ~ e T ^™) , (30) 

and we define 

P( v ) = e -/(») , (31) 

the probability distribution of both Vi, vt > 0. 

We assume that pw, and Vf are independent; this follows from the chaotic hypothesis [63], because 
for large r, Vi oc V(0) and Vf oc V(t) must be uncorrelated, and pw is an integral of a function evaluated 
for t £ [0, r] which is independent of V^(0), V(r) for most times t. We have then 



^r(pQ) 



T 2 



dpvF y dv l dv } e Tiw{pw) ~ f{v ^- f[v f ) 5 [pq -p w - V% ~ Vf 

oo 



(32) 



where we performed the integration over p\y using the delta function and changed variable from Vij to 
Vij/r. For large r we evaluate the integral at the saddle point and we obtain 

Cq{pq) = -logP T (p Q ) ~ max\( w (pQ - v t + v f ) [/(rui) + f(rvf)]\ . (33) 

T Vi,Vf T J 

The solution of the saddle point equations, detailed in Appendix D, gives the following asymptotic behavior: 
• for superexponential tails, f(v) ~ Av a , a > 1, one obtains Cq(pq) = Cw(pq) + 0(t _1 ) for all pq\ 
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for exponential tails, f(v) ~ Av, we get 

{(w{P-)+A(PQ-P-) P<P-, 
(q(pq) = I (w(pq) P- <P<P+ , (34) 

[Cw(p+) - A(p Q -p+) P>P+, 

where p± are defined by Cw(p±) = TA i.e. £q coincides with Qw for p e and outside this 

interval it is given by its continuation by straight lines with slope ±A 

for subexponential tails, f(v) <~ Av a , < a < 1, the values p± are defined by \p± — 1| = S <~ t~2=^ : 



r 2 - a 
o = 



ia(a - 1) 



(35) 



and 



Cq(pq) - i „_ Q -i, 1|Q i_ ( 36 ) 



for any finite (i.e. independent of r) 7 » (5. In the intermediate regime 6 < \pq — 1| < 7 the saddle 
point solution has to be calculated numerically to interpolate between the two regimes. The function 
Cq(pq) tends to zero for any pq ^ 1 and coincides with (w only in a small interval around pq = 1, 
whose amplitude shrinks to for r — > 00. 

for power-law tails, f(v) ~ /31ogw, we get the same as above with 



|p± - 1| = 2 ' / -<|(i)^ <37) 

and 

Co(Po)~--log[T|p -l|] (38) 
r 

for |pq — 1| > 7 for any finite 7. 

In brief, total derivative terms with superexponential tails are irrelevant, while exponential tails give a finite 
modification of £q , and subexponential tails give (q — > for all pq . This makes evident that total derivatives 
might have a dramatic effect. 

Assume for instance that in the previous example the function C,w satisfies the fluctuation relation. If 
V has exponential tails, from Eq. (34) it follows that the function (q satisfies the fluctuation relation only 
for \p\ < min(p_,p + ), while for large p the function (q(p) — (q{~p) ~ const, see Fig. 3 for an example. 
Even worse, in the case of subexponential or power-law tails, (q satisfies the fluctuation relation only for 
\p\ < —p-, with p_ — > 1 for r — > 00. Thus for large enough r the fluctuation relation is violated for all p, 
see Fig. 4, and (q(p) — Cq(~p) ~ * 0- It is interesting to remark that if (w is convex, (q is not if the tails 
are subexponentials. 

The case of exponential tails is particularly relevant because in many cases v oc V(q), the potential 
energy, whose distribution has tails e _/3v , i.e. exponentials. Cases in which the tails are subexponentials 
might be also relevant for experiments: for instance the anomalous tails in [4, 5, 8] might be explained by 
this computation. 

3.2 Removing unbounded total derivatives 

The conclusion of the previous section is that terms that are apparently small may affect dramatically the 
large deviations of a in presence of singularities. Still the two definitions of entropy production rate seem a 



16 




Figure 4: The case of power-law tails. (w{p) is as i n Fig- 3, while f(v) = 21og(u), i.e. P(v) ~ v~ 2 . The 
function Cq(p) is reported for r = 3, 10,30. In this case for r — ► oo, Cq(p) ~~ * 0- The fluctuation relation 
(right panel) is verified only for small p at small t. For r = 30 one has p_ > and the fluctuation relation 
is never verified. Note the similarity with the curves in Fig. 3 of Ref. [4]. 

priori equivalent from a thermodynamic point of view, and have the same average <r+ and the same moments 
(o~ k ) for any finite k (the moments being related to derivatives in p = 1 of Cx> (??))■ 

How can we identify a priori the correct microscopic definition of a, satisfying eventually the fluctuation 
relation? A possible prescription has been discussed in [63] and is based on the proof of the fluctuation 
relation for continuous-time systems given in [34]. The proof is based on a Poincare's section mapping the 
continuous-time flow into a discrete-time map. One considers a set of timing events: for instance, in a system 
of hard spheres, the times at which two particles collide, or, similarly, in Lennard- Jones systems, the times 
at which the total potential energy equals a suitable value Vq. In such a way the flow is reduced to a map, 
and the proof of the fluctuation theorem for maps [19] can be applied. However, the Poincare's section, i.e. 
the definition of timing events, has to be chosen carefully, in such a way that the resulting map is an Anosov 
map. This has been rigorously done for Anosov flows in [34]. 

For realistic systems this is clearly very difficult. Anyway, it seems very natural, if the system has 
singularities, to choose the timing events in such a way that singularities are avoided. For instance, if the 
potential energy V(t) of the system can become infinite in some points of phase space, one can choose the 
timing events t n by V(t n ) = Vq. In the example of the previous section, Eq. (28), the two definitions of a 
become equivalent on the Poincare's section. More generally, if singularities are avoided by the Poincare's 
section, all possible definitions of the phase space contraction rate a(t) differ by a bounded total derivative 
which gives no contribution in the limit r — * oo. 

In this way we can remove the ambiguity on the definition of <r, at least in the limit r — > oo. Note 
that this prescription does not guarantee that the phase space contraction for the discrete-time system is 
bounded. The possible values of the latter quantity are given by 

o-n = 1 —- f + o-(t)dt , (39) 

tn+1 t n 

and if o~(t) is not integrable close to a singularity a n can be unbounded. This happens clearly if a(t) oc 
(t — to) 1 , 7 < 1, for a singularity to £ [t ni t n+ ±]. In most interesting applications, e.g. driven Lennard- Jones 
systems in contact with a thermostat, it is possible to show that aw is bounded on all phase space, so that 
the singularities come only from the total derivative and are obviously integrable as in (28). In these cases 
the prescription on the Poincare's section is enough to remove singularities, and if the system verifies the 
conditions 1), 3), 4) of section 1.6, i.e. it is reversible, chaotic and has a dense attractor, the fluctuation 
relation is expected to hold from the chaotic hypothesis. 
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3.3 A proposal for data analysis in presence of singular terms 



To summarize, singularities might have important effects but these can be controlled, following the compu- 
tation and the prescription of the previous sections. 

First of all we note that the presence of unbounded terms is manifested by anomalous tails in Coo(p), 
exponentials or possibily subexponentials. It might also be manifested by the fact that Cx>(p) is not convex. 
A fit to the tails of the measured (p) allows to guess the behavior of the tails of the singular term, as the 
behavior of Coo(p) for large p is the same as the behavior of f(v) for large v. 

Assume that we have performed an experiment or a numerical simulation whose output is a very long 
time trace cr(t), t e [0,T], where one has chosen a definition of er(t) in terms of work, heat, etc. The usual 
procedure to verify the fluctuation relation is to fix a time delay r, and integrate a{t) over subsequent 
segments of the trajectory to obtain values of a T — r _1 J t ° T a(t)dt. One then constructs the histogram 
P((T T ) and the function Coo(p) which is used to test the fluctuation relation. 

In presence of singularities, the procedure must be modified as follows: 

• One defines in a suitable way 15 a set of timing events on the trajectory, t ni such that a(t) is not 
singular for t = t n , and that the average difference between two subsequent timing events is finite, 

TO = - t n ) < CO. 

• One fixes a delay k and constructs values of 



• Then one constructs the histogram Pk{<Jk) and the large deviation function £fc(p) = k 1 logPfc(j><7+), 
where er + = (cfc); the latter can be used to test the fluctuation relation in the limit k — > oo. 

If the system is reversible, chaotic and has a dense attractor, the function Cfc(p) should have a finite limit 
that will be convex and verify the fluctuation relation (8) for large k, according to the chaotic hypothesis. 

Then, to check the consistency of the analysis, one can recompute the original large deviation function 
Coc(p)- Indeed a guess on the tails of the function Vij can be done by looking at the tails of the function 
Coc(p), because as we discussed above the two functions have the same tails. Using this guess, one can apply 
the formulas of section 3.1 and check the self-consistency of the analysis. 

The analysis of this section has been confirmed in some cases by numerical simulations [48, 49, 67], and 
experimentally [7], and seems promising to interpret recent experiments on turbulent systems [4, 5]. 

Unfortunately, if a is not integrable between two timing events, i.e. (39) is not bounded, at present no 
general statement can be made. However this seems to be a very pathological case which should not be 
realized in most physical examples. 

4 On the convergence of the transient fluctuation relation to the 
stationary state fluctuation relation 

As we discussed in the introduction, even if it may seem that the transient fluctuation relation implies the 
fluctuation relation for stationary state in the limit r — > oo, this is not the case in general. The transient 
fluctuation relation holds in greater generality than the stationary state fluctuation relation. In this section 
we will discuss some examples that will highlight the importance of the properties of chaoticity and transitivity 
(i.e. the property that the attractor is dense in phase space) to ensure the validity of the fluctuation relation. 

15 One should take care here in order to avoid introducing biases in the sampling of cr fc (I thank A. Puglisi for this remark). 
For a system of particles one can for instance define as timing events the instants where a (suitably defined) "collision" takes 
place. If one is able to identify the source of singularities, i.e. what is the term in the phase space contraction rate that gives 
unbounded contributions to cr, better Poincare's sections can be eventually constructed. 




(40) 
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As we said in the introduction, chaoticity is important because it guarantees that initial data sampled 
with respect to the volume measure produce trajectories converging fast enough to the stationary state. 
Indeed, in experiments the system is often prepared at equilibrium, then the driving force is turned on and 
the system is let evolve toward the stationary state. If convergence to the stationary state is not fast enough, 
the observed trajectories might not be representative of the real stationary state, and confusing results may 
be obtained. 



4.1 Examples of systems such that P eq does not converge to the stationary state 
distribution 

4.1.1 A simple example 

A very simple example in which the distribution P eq does not converge to the stationary state distribution 
has been given in [33]. In this example the transient fluctuation relation holds for any finite time r, including 
the r — > oo limit, for P eq , but the real stationary state distribution trivially violates the fluctuation relation. 

The model describes a free particle evolving in two dimensions under the action of a constant force E 
and of a thermostatting force keeping its kinetic energy constant. If p is the momentum of the particle, its 
mass is 1 and we fix \p\ 2 = 1, the equation of motion are 

$=E-(E-p)p, (41) 

and introducing the angle 9 by p ■ E = Ecos9, where E — \E\, we can write the equation for 9: 

6 = -E sin 6>. (42) 

The dissipated power is given by W(t) = E ■ p(t) = E cos 9(t) and is equal to the entropy production rate, 
the "temperature" \p\ 2 being equal to 1. Note that W{t) = d9/d9 is the phase space contraction rate for 
Eq. (42). 

The equation of motion (42) with initial datum 9q is easily solved, 



tan^ = e~ Et ta,n^- . 
2 2 

The entropy production rate over the trajectory is given by, defining s(t) — tan6>(i)/2, 

1 r ir „, , If, 9Ecos9(t) 1 f m d9 

a T = - / dtEcosOit) = - / dt -V4 = / 

rj W tJ -EsmO(t) r J 8o tanfl 

s 2 



(43) 



1 sing(r) 1 1- 

= — log . ' = E - - log — — 
t smflo t 1 + e 



(44) 



-2Et„2 
S 



and it is easy to see that a T E \—E,E\. 

The distribution P® q (a T ) is computed imposing uniform distribution over the inital data #o- Then, 
inverting (44) to express so as a function of a T , we have 
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It is easy to check that this distribution verifies P® q {— a T ) = e TaT Pl q [o T ) for any finite time and that in the 
limit t — > oo we have, for p = <r T /E e (—1, 1), 



Coo(p) = lim - log P e eq (pE) = E V —± , 

r—foo T L 

such that Coo(^) — Coo(— p) = Ep = <7+p, i.e. the limiting distribution verifies the fluctuation relation. 



(46) 
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The stationary state corresponds to 6 = 0, then s — and a T — E; thus the distribution 

P e st {a T )=5{a T -E) (47) 

trivially violates the fluctuation relation. 
4.1.2 Another simple example 

In the previous example the stationary state function Coo(p) does not exist because P st is singular. One 
might argue that this pathology is responsible for the difference between Coo and Coo ■ However this is not the 
case. Consider as an example 16 a system whose state variable is an independent pair (x, 0) with 8 evolving 
according to (42), and x describing a reversible Anosov system. The phase space contraction rate is 

E(x,9) = Ecos6 + o x (x) , (48) 

where a x (x) is the contraction rate of the Anosov system. The distribution of S T = r _1 J Q T dt[E cos 6(i) + 
<j x (x{t))] is given, if initial data are sampled according to the volume measure, by 

P e9 (£ T ) = J do*P° q (X T o%)P* q {o%) , (49) 

with Pg given by (45). Given that both P® q and P x q verify the transient fluctuation relation (10), it 
follows easily that P eq (T, T ) verifies the same relation for any finite r and consequently the limit distribution 
Coc(p) = linv^oo t -1 logP eg (pS + ) verifies the fluctuation relation. Conversely, the distribution in stationary 
state is 

Pst (Zr )=J do* P 6 s t (S T - °%)P x si K) = J do*6(X T -a%- E)P* «) = P* (S T - E) , (50) 

and, given that P* t verifies the fluctuation relation, it is easy to see that the relation is not verified by 
P5t(S T ), neither for finite r nor in the limit r — > oo (this statement can be checked for instance assuming 
that Pf t is a Gaussian). 

To summarize, in this example the distribution t _1 log P eiJ (S T ) converges fast enough to a limiting 
distribution Coo(p) verifying the fluctuation relation. The true distribution Coo(p) = lim^oo t _1 logP st (S T ) 
exists, is analytic, but is different from Coo(p) and in particular does not verify the fluctuation relation. 

This example shows that the limiting procedure involved in passing from the transient fluctuation relation 
to the stationary state fluctuation relation is very subtle. In this example nothing seems to go wrong, all 
the functions are smooth and convergence is fast, but the true stationary state distribution is different from 
the limit of the distribution P eq . This kind of subtlety is peculiar to systems that do not display a chaotic 
behavior at least on some subset of the state variables 17 , and might be very difficult to detect in a numerical 
experiment. 

4.2 Hidden time scales 

The strange behavior of the examples above can be related to the existence of a hidden very large time 
scale 18 . This time scale can be revealed by adding a small noise term of variance e to Eq. (42). In presence 
of the noise, the system is able to explore the full phase space and the fluctuation relation holds in stationary 
state for any finite e. However, in the limit e — > 0, the time needed is r e ~ e 1 / 6 . For r <C t £ the fluctuation 

16 This example was suggested by F.Bonctto. A very similar example was discussed in Appendix Al of [45]. 
17 Note that the system is chaotic in the sense of having at least one positive Lyapunov exponent, but the attracting set is 
not dense in the phase space (x, 9) being concentrated on 9 = 0. 
18 The content of this section is based on ideas of J. Kurchan [68]. 
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relation is violated, while for r 3> t £ it is recovered. Clearly in the limit e — > 0, r £ — > oo and the fluctuation 
relation is violated for all r in stationary state [68] . 

As a simple example we consider a two-state system described by a spin variable St E { — 1,1}. The initial 
spin is chosen from -P(so) oc e hs ° , then the transition rate K(st, St+i) is proportional to e' ls *+ 1_/3 ' 7 ( s *' St + 1 ), 
where J(+, — ) = 1 and J = otherwise, i.e. J(s,t) = 5 Si+ 5t,-. Then the probability of a trajectory 
s = (s , ■ ■■ ,s T ) is given by 

P[s] oc e^SLo^-ZJEfco 1 J(«t,at+i) ) (51) 

i.e. the dynamical system 19 corresponds to a one-dimensional Ising chain where only pairs (+, — ) give a 
contribution 1 to the energy. 

Define the time reversed trajectory s — (— s T , • • • , — s ); then 

P[a] = e - 2h ^=° Si P[s] = e-™'l s lp[s] , (52) 

and this implies the fluctuation relation for the distribution of 

<rAs} = — X> , (53) 
T t=o 

for any finite /?. 

For infinite /3 the transition (+, — ) is forbidden: therefore, in stationary state only the trajectory s = 
(+, +, • • • , +) is possible. Thus P s t(<7 T ) = S(a T — 2h) and it does not verify the fluctuation relation, as in 
the examples discussed above. Note that transient trajectories of the form (— , • • • ,—,+,••• , +) are allowed, 
if initial data are extracted according to P(so) oc e hs ° , and the relation (52) holds for these transient 
trajectories. Then the transient fluctuation relation holds for P eq also for — oo at any finite r, and in the 
limit t — > oo . This is exactly the same situation we already discussed in section 4.1.1. 

The limit of very large but finite is particularly interesting: in this limit, the Ising chain (51) develops 
a large correlation length. Consider a segment of trajectory of length i beginning with +. The segment 
(+, • • • , +) has probability 0(1). Segments ending in the state — have at least one interface (+, — ) which 
can be everywhere in £, then their probability is <~ le~P. Therefore if one wants to observe a jump (+, — ) 
the segment of trajectory must have length I ~ e' 3 . This is the time we need to wait if we want to observe 
a jump to the state — which is needed to observe the fluctuation relation. We conclude that if r <C e 13 the 
fluctuation relation will be violated, while if r » e@ it will be verified. 

In the limit — > oo, the time scale diverges and this is why the fluctuation relation is violated also in 
the limit t — > oo: the limits — > oo and r — > oo cannot be exchanged [68]. 

4.3 Transitive Axiom C attractors 

The system (48) is a particular case of a more generic situation in which the motion of the system on its 
attractor can be described by a transitive Anosov system. If the system is reversible, and there is a unique 
attractor A+ and a unique repeller A- (i.e. an attractor for the time-reversed dynamics), the system is 
called an Axiom C system. Details can be found in [45, 44]. 

An extended version of the chaotic hypothesis is that, if the attracting set is not dense in phase space, 
the system can be regarded as an Axiom C system, i.e. the motion on the attracting set is described by 
an Anosov system, at least for the purpose of computing the physically interesting quantities. The SRB 
measure describing the stationary state will be given by an expression similar to (3), with the expansion 
rates computed on the attractor, times a "delta function" enforcing the constraint that the system is on the 
attractor. 

19 The reason why we call (51) a dynamical system is that the SRB measure becomes a Gibbs measure for an Ising chain 
when computed on a Markov partition. Therefore (51) is one of the simplest measures of the form (3). Alternatively one can 
think to this system as a Markov process. 
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In such systems, the phase space contraction rate can be written as in (48): 



£(x) = (T A (x) + <T±(x) , 



(54) 



where <ta( x ) describes the phase space contraction rate on the attractor, while a±(x) describes the part which 
is orthogonal to the attractor. One could then be interested in measuring the fluctuation of <ta( x ) to test 
the extended chaotic hypothesis, that implies that P(oa) verifies the fluctuation relation. Unfortunately, 
in general the attractor will be a complicated manifold and the explicit construction of <ja( x ) might be 
impossible. In particular, while £(x) is related to the entropy production rate, it is not obvious to relate 
o~a{x) to an experimentally accessible quantity. 

A possible strategy to access <ja{ x ), that works under some additional hypothesis, has been proposed 
in [45, 44]. Unfortunately for the moment a numerical verification of this proposal is not possible due to 
prohibitive computational costs [49], so we will not discuss this matter in more detail here. 

4.4 Summary 

We discussed some examples in which the transient fluctuation relation holds but the stationary state fluc- 
tuation relation is violated. This is due to the fact that (a subset of the) system is not chaotic, and might 
be related to a diverging "hidden" time scale whose existence can be revealed by adding a small noise [68]. 

5 Irreversible systems 

In the previous section we discussed some possible violation and/or modification of the fluctuation relation 
for systems that violate the requirements of smoothness, chaoticity and ergodicity/transitivity. The last 
ingredient which is required, as discussed in section 1.6, is the requirement of reversibility. This requirement 
is crucial: indeed, we see from Eq. (6) that the fluctuation relation compares the probability of trajectories 
x(i) having positive entropy production rate with the probability of their time reversed Ix(i) having negative 
entropy production rate. If the system is not reversible, nothing guarantees that the latter exist: the entropy 
production could be always positive and obviously Eq. (6) docs not make sense in this case. 

5.1 Reversible and irreversible models 

We will now discuss some examples [45, 69, 70] in which one can construct two models, one reversible and 
the other irreversible, that seems to describe the same physical system. We will see that the result for the 
large deviation function of the global entropy production rate, Coo(p), is very different. We will then discuss 
in what sense the two models might be equivalent. 

5.1.1 Reversible Gaussian thermostat and constant friction thermostat 

A class of reversible models which is believed to describe noncquilibrium systems are based on Gaussian 
thermostats. Consider a system of particles described by their position and momenta (pi, q{) with equations 
of motion 



Here V{q) is the potential energy of interaction between particles, and F(q) represents an external driving 

force that does not derive from a potential and injects energy into the system. The Gaussian multiplier 

2 

a(p, q) is defined by the condition that the total energy H (p, q) — J2i ^ + V(q)j or the kinetic temperature 
T(p) = ^ are constant. In the former case one has 



mqi = pi , 
Pi = -d gi V(q) + Fi(q) - a(p, q)p t . 



(55) 



a(p, q) = 



(56) 
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Models belonging to this class are often used to model electric conduction 20 , shear flow, heat flow, and so 
on, see e.g. [51] for a review. The equations (55) are reversible, the time reversal being simply I(p(t), q(t)) = 

(-p(-t),g(-t)). 

The phase space contraction rate for this equation is easily computed and gives 

m-^ dp, dq, r>fAT-i\ Y,iii F i{Q) W(p,q) 

°M = ^ + ^ = NdaM + ° {N ) = pvFE^> = %T' (57) 

where W(p, q) is the power injected by the external force. This result is an example of the identification 
between phase space contraction rate and entropy production rate we already discussed 21 . 

The Gaussian multiplier a(p, q) is a quantity 0(1) in the thermodynamic limit, and we expect its fluc- 
tuations to be 0(1/ VN). Therefore, in the thermodynamic limit, we can replace a(p,q) by its average 
v = (a(p, q)}, in the equation of motion (55): 

""IQi = Pi , , % 

(58) 

Pi = -d qi V(q) + Fi(q) - vpi . 

But if we do this, the new equations arc not reversible! Moreover, the phase space contraction rate is now 
°V (p, q) = Ndv and is always positive, so certainly the fluctuation relation does not hold for this quantity. We 
can ask whether the phase space contraction rate of the reversible equations, given by (57) and identified with 
the entropy production rate, still verifies the fluctuation relation if studied using the irreversible equations 
of motion (58). This is not the case. We have, from the definition of H, 

£-£*«(«) -Eg. <») 

i i 

Using this relation, Eq. (57) becomes 

a(p, q ) = Ndu + ^ d -§. (60) 

The first term is always positive. For large N, the second term averaged over a time r is roughly equal to 
tW§)) an< ^ wu ^ vamsn f° r T ~~ * 00 if H i s bounded 22 . Therefore the integral of cr(p,q) in (57) is always 
positive for large r, and the fluctuation relation cannot hold for this quantity. 

We conclude that the large deviations of a(p, q) are different in the two cases, and in the irreversible case 
do not verify the fluctuation relation. 

This is not surprising, since cr(p, q) is a global quantity, and it is well known in equilibrium statistical 
mechanics that global quantities can have very different behavior if computed in different ensembles . 

Instead, the two equations (55) and (58) might be equivalent for the purpose of computing properties of 
local observables, i.e. quatities that depend only on the particles which are in a small box inside the system, 
in the thermodynamic limit. In this case the equivalence might hold also for large deviations. We will discuss 
this point in the following sections. 



20 In this case one obtains a reversible version of the Drude model. 

21 Note that if instead of fixing the total energy we fix the kinetic temperature, the value of a and a change by a term 
proportional to T ^ Thus these two ensembles produce equivalent fluctuations of a, in the thermodynamic limit, only if 
the potential V is not singular. In presence of singularities (e.g. for a Lcnnard- Jones potential) one has to apply the prescription 
discussed in section 3 to remove the singular term. This has been shown in [48]. A discussion of the difficulties that one has to 
face to give a mathematical proof of the equivalence is in [71]. 

22 If the function H is not bounded, the fluctuations of H will introduce spurious contributions but still the fluctuation relation 
will not hold for cr(p,q). 

23 The same happens in equilibrium statistical mechanics: for instance the global energy fluctuates in the canonical ensemble 
and obviously does not fluctuate in the microcanonical ensemble. 
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5.1.2 Turbulence and the Navier-Stokes equations 

Another interesting example is the case of the Navier-Stokes equations. These equations are not reversible. 
However, it has been conjectured in [69, 23] that the Navier-Stokes equations might be equivalent, in some 
situations, to reversible equations. The equivalence is in the same spirit discussed in the previous section. 

Numerical results supporting this conjecture have been given in [56, 57, 58], where the validity of the 
fluctuation relation has been numerically verified for the reversible equations. 

Note that, an argument similar to the one discussed in the previous section [70] leads to the same 
conclusion, that the fluctuation relation cannot hold for the global entropy production in irreversible Navier- 
Stokes equations. 

5.1.3 The granular gas 

The case of granular gases is particularly illustrative. A granular gas is a system of macroscopic particles 
(typically of radius R <~ 1 mm and mass m ~ 1 mg) in a container of side L interacting via inelastic collisions 
(typically with restitution coefficient r ~ 0.8-^0.9) and having a large kinetic energy ^m(v 2 ) 3> mgR, where 
g is the acceleration due to gravity. Energy is injected into the system by shaking the box or vibrating one 
of its sides. This system behaves like a gas of hard particles but dissipation is present due to the inelasticity 
of the collisions. 

The first experiment on such a system [8] considered a window of smaller side I inside the box and 
measured the flux P T of kinetic energy entering the window during a time lapse r. The latter seemed to 
verify a fluctuation relation at least for small p = P T / {P T ). Note that the kinetic energy flux can be written 

as 

P T = AE + D T , (61) 

where AE = E(t) — E(0) is the variation of kinetic energy inside the window, while D T is the energy 
dissipated into the window by inelastic collisions during time r. 

A very reasonable model for the inelastic collisions between particles [72, 73, 65, 70] is to assume that 
the velocity component parallel to the collision axis is rescaled by r, in such a way that the relative variation 
of kinetic energy in the collision is f — r 2 > 0. In such a model, the dissipation D T is always positive, and 
phase space always contracts so that a > 0. It was then recognized [72] that, as D T > in this model, 
the fluctuation relation cannot hold for P T . Indeed, if AE were bounded, the large deviation functions of 
P T and D T would coincide implying that the probability of observing a negative fluctuation of P T is zero 
for large r. However, AE is not bounded, and its probability distribution has an exponential tail, see the 
discussion in [65]; the method described in section 3 can be applied and shows that even if D T > 0, P T can 
have negative fluctuations. This explains why such fluctuations were observed in [8]. The apparent validity 
of the fluctuation relation observed in [8] is probably explained by the smallness of the interval of p that was 
accessible to the experiment, in such a way that the function Coo(p) — Coc(— p) appeared as linear in p. 

The conclusion is that the fluctuation relation does not hold for the quantities D T (obviously) and P T (by 
(61) and the results of section 3), at least if the model for inelastic collisions used in [72, 73, 65] is accepted. 

One can easily construct a model of reversible inelastic collisions, in which in a collision particles can gain 
or loose energy, such that on average energy is lost but the dynamics is reversible 24 . Such a model should 
give similar results for average quantities but will give a different result for the large deviations of P and D, 
which will now probabily verify the fluctuation relation. 

5.2 The time scale for reversibility 

Different models, reversible or irreversible, give different results when one computes large deviations of global 
quantities, and in particular the fluctuation relation holds for the reversible models but does not hold for 
the irreversible ones. 

24 F. Bonetto, private communication. 
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If we want to investigate the fluctuations of the global entropy production rate, we have to ask, given 
a physical system on which we are performing a measurement, what is the more appropriate mathematical 
model, between the reversible and the irreversible ones, to describe its properties? 

Let us discuss this problem for the granular gas we modeled above. It is reasonable that the real 
system is described, at an atomic level, by reversible equations of motion (i.e. the Newton equations for the 
atoms constituting the macroscopic particles). In a collisions, the atoms interact in such a way that kinetic 
energy of the particles is dissipated by heating the two particles. Morover sound waves can be emitted as the 
experiment is performed in air 25 . Thus, in principle, it can happen that two particles, while colliding, absorb 
a sound wave and/or cool spontaneously in such a way that the kinetic energy is augmented by the collision. 
Clearly, the probability of such a process is very small: it is of the order of exp(— N), where N <~ 10 23 is 
the number of atoms constituting a particle 26 . Thus, on the experimental time scale r <C r exp(iV) one can 
safely neglect this possibility, the system is well described by the model in which velocity are rescaled by 
r at each collision, and the fluctuation relation does not hold. However, if one could imagine to wait for a 
time r 3> Toexp(7V), the fluctuation relation should be observed to hold. In this case the violation of the 
fluctuation relation is related to a mechanism very similar to the one discussed in section 4.2, i.e. it is related 
to an "hidden" diverging time scale (the time scale on which reversibility of the collisions can be observed) . 

When the time scale needed for observe reversibility is not larger that the experimentally accessible time 
scales, the system should be well described by reversible models. 

5.2.1 A proposal for an experiment on granular gases 

An example of this procedure was discussed in [74] . We considered a two dimensional granular gas contained 
in a box; the bottom of the box is vibrated while the other sides are fixed. This geometry is different from 
the one considered in [8] where the whole box is vibrated. 

In this situation, a temperature profile is established in the system; kinetic energy is injected at the 
bottom and starts to flow trough the system towards the top of the box, being dissipated in the meanwhile 
by inelastic collisions. The energy dissipated by collisions is again always positive, and obviously cannot 
satisfy the fluctuation relation. 

However, one can look to a different quantity, namely the flow J T of energy through a small portion 
of the system located between two lines at height h, h + S. This quantity is not always positive and 
cannot be expressed as a positive quantity plus a total derivative, as we did for P T in (61). We argued 
that, in a suitable quasi-elastic limit [74], see also [75], the portion of the system between h, h + S can be 
thought as being equivalent to a system of elastic hard particles in contact with two thermostats at different 
temperatures 7+ , T_ . This system can be described by a reversible model [74] . This analysis predicts that 
the fluctuation relation should hold for J T at least in a suitable quasi-elastic limit. It should be possible to 
verify experimentally (or at least numerically) this prediction. 

5.3 Ensemble equivalence 

As we discussed in the previous section, if one looks at global quantities (e.g. in a numerical simulation), 
the result might depend on the details of the model which is assumed to describe well the system under 
investigation. In particular, the choice between a reversible and an irreversible model should be motivated 
by a careful analysis of the involved time scales. 

However, this is not so natural. In fact, one of the main results of equilibrium statistical mechanics is 
that the relation between the interesting observables (pressure, density, energy, ...) are independent of the 
particular ensemble one chooses to describe the system. And indeed, in a real experiment, it is very difficult 
to distinguish between the system under investigation and the surrounding environment, and to make a 

25 Indccd the experiment is very noisy. 

26 An argument supporting this scaling comes from the ideas of Kurchan [68] discussed in section 4. Indeed, fluctuations of 
the internal energy of the two particles are 0(iV -1 / 2 ). We can consider them as a small noise of variance e oc N^ 1 acting 
during the collision; then in the limit of small noise the time scale is r = r^e 1 / 6 = roe^. 
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detailed model of the interaction between them. For instance, one would clearly like that the result does not 
depend on the details of the particular device one uses to remove heat from the system, and so on. 

To avoid such difficulties, one should consider the system under investigation as a subsystem of a larger 
system, including the thermostats, and show that the relations between the interesting observables of the 
subsystem do not depend on the details of the model one chooses to describe the whole system. This would 
also justify the use of phenomenological models like (55), (58), which are clearly unphysical as one assumes 
that there is a "viscous" term acting on each atom of the fluid. 

Proving equivalence of ensembles in nonequilibrium is much more difficult than in equilibrium, because in 
the former case dynamics is important in defining the ensembles: as the SRB measure (3) explicitely depends 
on the dynamics of the system. For this reason no exact results are available, but only conjectures [76, 22, 23] 
and arguments supporting them [71, 77], see in particular [77] for a detailed discussion. 

The equivalence between two different nonequilibrium ensembles can be defined as follows. We consider 
as an example the two models (55), (58); the equations of motion depend on a number of parameters, such 
as the density of particles, the interaction potential V(q), the external forcing F(q) etc.; having fixed these 
parameters, the model (55) depends on the value of the energy E = H(p,q), while (58) depends on the 
value of v. The corresponding ensembles £ E , £ v are the collection of the SRB distributions that describe the 
stationary states of (55), (58) at different E or v, respectively. 

Consider now a volume Vq inside the container V in which the model is defined, and a set 0(Vq) of 
observables Oy (p, q) that depend only on the positions and momenta of the particles inside Vq- 

The equivalence of £e, £ v means that in the limit N, V — > oo at fixed N/V it is possible to establish a 
one-to-one correspondence between elements o/£e and£ v such that the averages of all observables O G 0(Vq) 
are equal in corresponding elements. 

In this example, the element \i v G £ v corresponding to the element e is defined by the condition 
that 

v{E) = (a(p, q)) E = J dpdq fj, E (p, q)a{ Pl q) , (62) 
and conversely the element he corresponding to a given \i v is defined by 



E{v) = {H{p,q)) v = J dpdq^(p,q)H(p,q) . (63) 
Equivalence of the two ensembles means that for all local observables Oy G 0{V a ) it holds 

mv Um Mn A°yo(P^)),(E) = NV lim NI AO Vo {p,q)) E . (64) 

N , V -^oo , p— N/ V - ' N, V -^oo , p— N/ V 

In the case of the two ensembles defined by (55), (58), the equivalence is supported by the concentration 
argument discussed above, namely that for N, V — > oo the fluctuations of a(p, q) in the isoenergetic ensemble 
vanish, sec [71, 77] for more detailed discussions. 

5.3.1 A local fluctuation theorem 

Given the definitions above, it is natural to try to define a local entropy production rate and prove for this 
quantity a local fluctuation theorem. In the example (55), (58), a possible definition is a local version of (57): 

/ \ ^imevo Qi F i(l) W Vo (p,q) 

av ° (p ' q) = Wr^-^U^ = ' (65) 

Then, the equivalence conjecture (64), applied to the average of Oy (p, q) = e A<Tv o(p^) (that generates the 
probability distribution of ov ), implies that the large deviations function (v (p) is the same in the two 
ensembles, in the thermodynamic limit. In principle this function can satisfy the fluctuation relation (8) 
even if one of the two ensembles is irreversible. This would be a very interesting result. Unfortunately, from 
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the theoretical point of view the problem is very difficult. A tentative theory has been discussed in [78] and 
a numerical verification on a simple model of coupled maps has been reported in [79] . 

In numerical simulations of more realistic models, see e.g. [54], the results are much more difficult to 
interpret. As we discussed in section 2, even in the case of a "perfect" system (i.e. smooth, reversible, and 
transitive) the verification of the fluctuation relation is very difficult as long as the number of particles is 
bigger than ~ 20. Taking the limit of N — > oo means that one should look at a subsystem of ~ 10 particles in 
an environment of, say, ~ 1000 particles, and this, at present, has prohibitive computational costs. Moreover 
if the volume Vo is too small, as required by numerical simulations, one has to take into account nontrivial 
terms in the entropy production rate, related to the fluxes (of particles, energy, entropy) across the surface 
of Vo, see [74] for a tentative discussion of this problem. 

On the other hand, looking at local quantities is very natural in experiments, see e.g. [2, 3, 5, 8]. However 
the interpretation of these experiments is not clear for the moment (see [77] for a tentative interpretation 
of [2]), especially because the relation between the measured quantities and the local entropy production is 
not straightforward. The presence of large tails in the measured distributions suggests also that unbounded 
terms are affecting the measurements; it would be very interesting to try to apply the analysis described at 
the end of section 3 to these data. 

5.4 Summary 

The fluctuation relation cannot hold, even locally, if the time scale to observe reversibility exceeds the 
experimentally accessible time scale. Nevertheless, there are system that might be well described by reversible 
equations of motion, if this time scale is not too large. In these cases the fluctuation relation should hold. 
Clarifying this issue is clearly of fundamental importance to interpret experiments on real nonequilibrium 
systems. 

It is worth to note that even if the fluctuation relation does not hold due to irreversibility, this does 
not mean that the chaotic hypothesis does not apply. If the system is chaotic and smooth enough, still the 
stationary state should be described by the SRB measure (3). It would be very interesting to derive from the 
chaotic hypothesis, using the measure (3), other relations, independent of reversibility, that could be tested 
in experiments. 

6 Conclusions 

To conclude, we will briefly summarize the main points discussed in this paper. 

1. The chaotic hypothesis states that the fluctuation relation will be generically verified by models that 
are reversible, smooth, chaotic and transitive. 

2. For such models, numerical simulations have confirmed the validity of these predictions. 

3. A test of the fluctuation relation, even in models verifying the hypotheses above, is made difficult 
by the necessity of observing negative values of p. In particular, one should check that the function 
£ T (p) = - logP(per + ) is independent of r for p € P. The interval P must contain the origin. 

4. Negative fluctuations can be enhanced by reducing the system size N, the observation time r, or 
the strength of the applied field. The observation time r cannot be reduced arbitrarily because the 
fluctuation relation holds only for t 3> To, tq being the characteristic decorrelation time of the system. 
Eventually we can eliminate some finite r corrections, e.g. by shifting p in such a way that the maximum 
of ( T (p) is assumed in p = 1 . 

5. If the applied field is so small that the system is close to equilibrium, and if the distribution turns out 
to be Gaussian over the whole accessible interval in p, we are not verifying the fluctuation relation, 
because in this case it is equivalent to the Green-Kubo relations. 
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6. It is important to check not only that Coo(p) — Coc(~ p) oc p, but also that the proportionality constant 
is a + . The linearity in p could simply be due to the smallness of the accessible interval P. 

7. The presence of singular terms in a {non-smooth systems) can change dramatically the behavior of 
C(f>)- These terms manifest in anomalous large tails; in these cases the function £(p) might not be 
convex. If these terms are present, one should remove them by applying the procedure discussed in 
section 3. 

8. For systems that are not chaotic or not transitive any kind of strange behavior can (and has) been 
observed. For this reason a test of the fluctuation relation is interesting: it supports the validity of the 
chaotic hypothesis, i.e. that the system is chaotic and transitive (but obviously does not prove these 
properties). 

9. If one looks to the global entropy production rate, the fluctuation relation will not hold for irreversible 
systems. 

10. However, it is possible that different models which are globally reversible or irreversible, might be 
equivalent when observed on a local scale. If this is the case, a local fluctuation relation might hold 
independently of the model (reversible or irreversible) one chooses to describe the system on large scale. 

11. This is very important for the interpretation of experiments. If the results were found to depend 
strongly on the details of the model, one would have to take into account all the details of the system, 
including the thermostat, etc. 

Hopefully new experiments will be able to clarify the many open problems, in particular the last point. 
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A Lyapunov exponents and the phase space contraction rate 

We considered a system described by the equation of motion x = F(x) where x — (xi, ■ ■ ■ ,Xn). Consider 
a segment of trajectory x(i), t £ [0, r] (time can be continuous or discrete). The Jacobian matrix of the 
trasformation dx(0) — ► dx(r) is given by 

has eigenvalues Sj[x(t)], i = 1, • • • , N. The Lyapunov exponents can be defined as 

A i [x(t)] = ilog|s i [x(t)]| . (67) 
r 

Thus positive Lyapunov exponents correspond to expanding directions, while negative exponents correspond 
to contracting directions. 
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The expansion factor is 

A u [x(t)] = e ^ 4 , + ^[x(t)] = | de taS ij [x(t)] u | , (68) 

where the sum ^ , is restricted only to positive exponents and the determinant is restricted to the unstable 
directions (corresponding to the positive exponents). The contraction factor is 

A a [x(i)] = e'-Ei.-^W*)] = |detaS tf [x(t)].| , (69) 

so that the total phase space contraction rate is 

e -r«T[x(t)] = e rZ: ( A,[x(t)] = | det ftSy [x(t)]| = A„[x(i)]A 5 [x(i)] . (70) 

For very small times 27 t = dt and initial datum x we have 

dSij [x, dt] = <% + dt-£-± , (71) 



so that 



and (70) becomes 



Then it follows 



|det5S y [x(t)]| = l + <ft^^-^ + 0(dt 2 ) . (72) 



tr[x(t)] = -- / cttV • F(x(t)) . (74) 
T Jo 

For the time reversed trajectory x(r — i), it is easy to see that 

dSiMr t)} = g|| = [^[x(i)]]- 1 , (75) 

and the sign of the Lyapunov exponents are exchanged 28 , as it is needed for the derivation of (6). 

Finally note that a(x) is defined by -^dx = —a(x)dx. Then, if instead of dx we use the measure e~ v ^dx, 
it is easy to see that 

a'(x) = ^— ^e- ip(x) dx^a(x)+ip , (76) 

v > e-^dx dt v ; 

i.e. a change of metric changes a(x) by a total derivative. 



B The transient fluctuation relation 

Here we sketch the derivation of Eq. (10) for the microcanonical ensemble. The probability P e9 {cr[x(i)] = a} 
is proportional to the volume Vq of initial data Xq such that the subsequent trajectory gives a total phase 
space contraction <r[x(t)] = a. Then, by definition of the phase space contraction rate, the volume of the 
set of the final points x T — x(t) will be V T — e TCT Vo- Each point 29 x(t) is the starting point of a time- 
reversed trajectory having phase space contraction rate u[x(t)] = — a, so the volume V T is proportional to 
the probability P eq {a[x(t)] = — <r}, and Eq. (10) follows. 

27 For discrete times clearly the following derivation is not possible and one has instead tr[x(t)] = — i ~^2tZo 1°6 I det dS(x)\. 

28 Note that the time reversal might involve a transformation U such that U 2 = 1, i.e. the time reversed trajectory is 
Ix.(t) = Ux(t — t). For instance in the case of an Hamiltonian system one has to change sign to the momenta. In this case 
9S|/x] = UdS [x] -1 f7 and if V{ is an eigenvector of 95 [x] with eigenvalue Sj, then Uvi is an eigenvector of dS [/x] with eigenvalue 

29 Or eventually Ux(r), but as U 2 = 1 the map U conserves volume. 
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C A precise definition of Anosov systems 



An Anosov system is denned as follows [24]. Given a compact, smooth and boundaryless manifold M (phase 
space), a map S G C 2 (M) is an Anosov map if: 

(1) For all x £ M the tangent plane to M in x, T x , admits a decomposition T x — T^Q)T^, such that 

(2) the planes T x ' u vary continuously w.r.t. x, i.e. the vectors defining them are continuous functions of x; 

(3) the angle between and T", defined as the minimum angle between a vector in and a vector in T", 
is not vanishing; 

(4) defining OS the linearization matrix of S in x, i.e. S(x + ev) = S(x) + e dS(x) -v + 0(e 2 ), x G M, v G T x , 
e e K small, the planes T*' u are conserved under 5, i.e. if v G T^ u , then dS^) ' u € T^"; 

f5j for all x e M and for all v e T x one has |SS'(a;) fe t;| S fe a . < A _fe C|w| 2; , while for all v e T x one has 
\dS(x)~ k v\s-k x < A~ k C\v\ x , | • \ x being the norm on T x , for some constants C, A > 0; 
(6) there is a point x which has a dense orbit in M. 

The hypotheses above imply that it is possible to identify two families of smooth manifolds M 8 ' u in M, such 
that T£' u are the tangent plane to M s ' u in x, and such that points on M s tend to converge exponentially 
while points on M u tend to diverge exponentially under the action of S. 

This means that for each x £ M there is a stable manifold M x such that for all y £ M x one has d(S k x, S k y) < 
A~ k Cd(x, y), and an unstable manifold such that for all y G one has d(S~ k x, S~ k y) < A~ k Cd(x, y), 
where d(x, y) is the distance on M, see Fig. 1. 



D Saddle point equation for the distribution (q 

This computation is reported in full detail as it has not been previously published. The saddle point equation 
corresponding to (33) are 

~Cw(PQ ~Vi + Vf) = f'(TVi) , ^ 
CwiPQ -Vi+Vf) =f'(TVf) . 

For large values of its argument, f(v) must be an increasing function, then f'(v) > 0. We will see that the 
saddle point values of Vi and Vf are small. Then, if pq > 1, the second equation (77) has no solution 30 , 
because Cw(pq) < 0; means that Vf will stick to the boundary of the integration region, Vf = 0. 
Conversely, for pq < 1, we have = 0. Finally, we get for pq > 1 

Cq(pq) ~ max |c^(pq - Vi) - ^/(rui)j , -C(v(^Q - w = /'(™») i (78) 

and for > 1 

Cq(pq) ~ max { Cw(pq + u/) - -f(rv f ) 1 , CV(PQ + w /) = /'( TW /) • ( 79 ) 

In the following we will discuss the behavior for pq > 1, the other case can be discussed in the same way. 
Superexponential tails 

First we assume that /(f) <~ Av a 7 a > 1, for v — ► oo, i.e. the probability of v decays faster than 
exponentially We have 

-CwiPQ ~ Vi) = aAirVi)*- 1 . (80) 

Clearly vi must be small, and we get Vi = t _1 [— ( w (pQ)/aA} 1 ^ a ^ 1 ' > cx r" 1 , and 31 Cq{pq) = Civ(pq)+0(t _1 ) 
for all p. 



30 The unconvinced reader can check this statement by explicit computation in simple cases. 

31 Note that in this case the result is inconsistent with the initial hypothesis that TVi is large. The computation has to be 
repeated by considering the full function f(v); we find Vi = r _1 (/') _1 [~Cw (pq)] s * u ' k t ~ 1 ' thus the result is unchanged. 
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Exponential tails 

For f(v) ~ Av 7 we get 

-CwiPQ - Vi) = A , (81) 

As Vi must be positive, and Cw{Pq) ^ s negative and decreasing on incresing pq, the latter equation has 
a solution only if Cw(pq) < — A otherwise = 0. Calling p + the solution of Cw(p+) = — A we § e t 
< =PQ-P+ and 

, , x /Cw(pq) P <P+ , , QO , 

(q(pq)=<s , , s ^ (82) 

{Cw(P+) - A{pq -p + ) P>P+, 

i.e. Cq coincides with ^ up to p = p + and then it is given by its continuation as a straight line with slope 
({ v (p + ). A similar result is obtained for pq < 1. 

Subexponential tails 

For f(v) ~ Av a , < a < 1, the saddle point equation becomes 

-C' w (pQ-v t )=T a - 1 Aavr 1 ■ (83) 

In addition to the previous solution Vi kt" 1 , we can look for a solution such that pq — Vi = 1 + e with e 
small. Indeed as C,' W {1) — the left hand side will be small, and as Vi ~ pq — 1 is finite the left hand side 
will also be small. We get then 

-CwiPQ - Vi) ~ -CwWiPQ -1-Vi) = T^Aav?- 1 > (84) 

and we recall that — C{y(l) > 0- A graphical analysis of the previous equation reveals that it has no solution 
as long as pq < p+; the value of p + is defined by the system 



-CwWiPQ ~ 1 - v i) - ■ — * 
C(l) = r«- 1 Aa(a-lK- 2 , 



whose solution is easily found 



* 1 — a 
Cw(l) 



T<*-Ma(a-1) 



(86) 



From the above expression we find that p+ — 1 <~ v* <~ t~^°. This is consistent with the hypotheses we 
made above. Note also that ru* is large consistently with the assumption that only the large tail of f(v) is 
relevant for this calculation. 

For pq < p + the saddle point equation has no solution, it follows that Vi = and (q(pq) = C,w{pq)- For 
PQ > P+i the equation has two solution. As discussed above, for finite pq — 1 one solution is V{ ~ t -1 while 
the other one is Uj <~ pq — 1 — e for small e > 0. The latter gives the maximum in (33) while the former is a 
minimum. The value of e is given by 

Aa(pQ - IT' 1 



• = -ttW- ■ (87) 



Substituting this value into (33) we get 



Cq(pq) ~ ^^e 2 - Ar a -\p Q - 1)« ~ -Ar a -\p Q - 1)" + O^^) , 

i.e. (q(pq) is dominated by the tail of P(v) and one gets Cq(pq) — * for all pq. 
For p < 1 , changing Uj — > , we get 

i-p_ = uif=s 

J 1 — a 



T°- 1 Aa(a-l) 



(89) 
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i.e. 1 — p_ = p + — 1 so that p_ and p + are symmetrically distributed around 1, and for pq < p_ 

C Q (PQ)--AT a - 1 (l- PQ r + 0(T 2 ^) . (90) 
To summarize, we obtain two values of p, p±, defined by \p± — 1| = 8 <~ r _3 ^: 

CiMi) 



1 — Q 

such that 



ia(a - 1)_ 

{(q{pq) = Cw(pq) \Pq~M<8 
\Cq(pq) = -At*- 1 \ PQ -l\ a |pg-l|>7 



(91) 



(92) 



for any finite 7 » 8. In the intermediate regime 8 < \pq — 1| < 7 the saddle point solution has to be 
calculated numerically to interpolate between the two regimes. 

Power-law tails 

If Vij have power-law tails for large v, P(v) ~ ^,we have f(v) ~ (3 \og(v), and the saddle point equations 
is for pq > 1 

-CV(PQ " «i) = — . ( 93 ) 
and as before we can expand the l.h.s. around pq — Vi ~ 1 as the r.h.s. is small: 

-C(l)(PQ-^-l) = ^-. (94) 

TVi 

A solution appears for pq > p+ where 



s 2 



Eq.(94) becomes 

vf - Vi( PQ -1) + - = Q (96) 
with solutions 

As before the correct solution is the largest one, 



and as before it behaves as 



v*(Pq) ~ PQ ~ 1 ~ 4 ^ _ ^ = PQ ~ 1 ~ £ (99) 
for large r (i.e. small <$). Substituting v*(pq) in Eq.(78) we get 

Cq(pq) = Cw(PQ - v*(p Q )) - Pt- 1 logK*( Pg )] ~ -^log[r(p Q - 1)] + 0{t- x ) (100) 

T 

Note however that constant shifts of the order of log(T)/r are present due to the factor r 2 in front of Eq.(32) 
and from the Gaussian fluctuations around the saddle-point. An empirical way to find the constant is to 
impose the continuity of C,q{pq), i.e. (q(p+) = (w(p+)- From this it follows 

(q(pq) = Cw(pq - v*{p Q )) - Pt- 1 \og[2v*{p Q )/8] - C w (l + 8/2) + Q W {1 + 8) . (101) 
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A similar calculation can be done for pq < 1 and leads to 

C(Po)~-^Iog[r|po-l|] (102) 

for large pq . 
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